|
|
Accession Number |
TCMCG075C23448 |
gbkey |
CDS |
Protein Id |
XP_017980837.1 |
Location |
complement(join(2794435..2794464,2794827..2794898,2795017..2795938,2796212..2796381,2796473..2796493,2796613..2796737,2797743..2797790,2798035..2798156,2798239..2798267,2798498..2798542,2798642..2798829,2799051..2799207,2799295..2799378,2799470..2799563,2799997..2800105,2800465..2800560,2800658..2800730,2800805..2800892,2801493..2801563,2801653..2801726,2801802..2801863,2802792..2802933,2803023..2803123,2803240..2803284,2803401..2803465,2803668..2803743,2804167..2804260,2804375..2804723)) |
Gene |
LOC18591724 |
GeneID |
18591724 |
Organism |
Theobroma cacao |
|
|
Length |
1183aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_018125348.1
|
Definition |
PREDICTED: Fanconi anemia group J protein isoform X4 [Theobroma cacao] |
CDS: ATGACGTCTGCAGCAACCGGGGCCAACATCAATCCTAAAACTGTGTATCATATTGCAGGAATTCCGGTGGAATTCCCGTACAAGCCGTACGGGACGCAGTTCACCTTCATGTACAGAGTCATATCAACCCTGGATCGAGCTCAGAAAGACGGTCATTGTCACACCTTACTCGAGTCGCCCACCGGTACCGGCAAATCGCTATCGCTTCTTTGCTCCACTCTGGCCTGGCAACAGAAGTATAAGCTCAAGAATCTCCAAGGCTGTTTGTCTCACTCCACACCTGCACCGGAGGCCATCACCGATCCTCTCGGTCACGGCGGTGGTTTTATCCCAGAAACGCAACCTTCAAGCGTTCCTCCATCAGGGATTTCAGAGCCACCTCAACAATCAGCAAATAGCAAGAATAAGAAGAAAAAGTTGGCTCCTACGATATATTATGCTTCGAGGACACATTCACAAATTTCTCAAGTGATTCGTGAATACAGGAAAACTTCTTACCGAGTGCAAATGGCGGTATTGGCCTCAAGGAAAAATTATTGCACAAATCCACATGTTCTTGGGAAGCAGAATATTGATGAAGAATGCAAGCTGCTTTTGAGCAATCGAGAGGAAGGATGTTTTGAATTTAAAAATATGCATAAAGTCAAATGTCATCCCTCACTTCAGAAAGGAGGCTGTCACGAGGCCCATGATATTGAAGATCTTGTCAAAGTTGGACAAATTGTTAAAGGATGTGCATACTATGCTGCACGTTCCATGGCTGATGATGCGCAATTAGTTTTTTGCCCGTATAGCTACATCATCAATCCTGTTATTCGAGGAGCAATGGATGTAGACATTAAAGGAGCCATTATAGTTCTTGATGAAGCTCACAATATAGAGGATATTGCTCGTGATGCTGGTAGTGTGGATCTTGAAGAAGATGCTTTGCTCAAATTGCAGACGGAGCTAGAGCAGCTCTACATGATTGATGCCACAATTTACCAACCACTGTATGAAATGATACAGGACCTAATGGGTTGGATTGAGCAGAGGAAAAGCACGTTAGAAAAGCATGAATTTCGGCACTACTTCTCCTCCTGGACCGGTGACAAGGCATTAAGACAACTCCAAGAAGCTAACATTTCACAGCAATGCTTCCCCATCTTGCTAGATTGTGCAACAAAGGCAATCAAAGCTGCTTCAGATACAGAGTCAGATGCACCTCATTTAAGTGGCATGTCTGTCATAACATTGGAAGGGCTGTTCTCTTCACTGACATATTTCTTCTCAAGGAATGGTTCTCATATATTTGATTATCAGCTTGCTCTACAAAGATATGTTAAAAAGGATGGCGAAAATGCTTTTGGCAATTGGACATGTTCTTTAAGTTTGTGGTGCTTGAATCCAGCGGTTGTTTTCAGAGACATTGCTGAACTTTCATTGTCTGTCATTTTAACATCTGGGACATTGTCACCAATGAATTCCTTCTCATCTGAACTTGGAGTTCAGTTTGGAAACTGTTTGGAGGCTCCTCATGTTATTGATGTTAAATCCCAGGTGTGGTCTGCTATAATCTCCCATGGTCCAGGTAATTATCGATTGGATGCAAGTTACAAAACAGCAGATACATATGCTTTCCAGGATGCACTCGGCAAATCCTTAGAAGAAATTTGCAAGATTGTTCCAGGAGGCTCTCTGATTTTCTTCCCTAGTTACAAGCTGATGGAGAAACTTTGCAAGCGTTGGCGTGAAACAGGCCAATGGTTGCGGCTTAATGCAAGAAAGCGCCTTTTTGTTGAGCCAAGAGGAGGTAATCAGGAAGAATTTGACACTGTTCTGAAAGGTTATTATGATTCAGTATCTGGAGGCAAGGAACCTGGTCTTGGGAGAAAGAAAAGGATTAAAAAAGCAGATCACAATGCAATTGAGTCCACAGAAGTTACTAACCATGAAGGTGCTGCTTTTCTTGCAGTTTTCCGAGGAAAGGTCTCAGAAGGCATAGATTTTTCTGATGATAATGCTAGAGTGGTGATAGTTGTTGGCATTCCCTTTCCAAATATAAATGATATTCAGATTGCTTTAAAGAAGAAATATAATAACACTTACAAATCATCTAAAAACCTTCTCAGTGGAAGTGAGTGGTACTGCCACCAAGCCTTCAGAGCTCTAAATCAAGCTCTTGGACGCTGTATACGTCATAAATTTGATTATGGAGCCATCATCCTATTAGATTCACGTTTTCAAGAAGAAAAGAATAGAGCTTACATTTCAAAGTGGCTAAGGCCATCCGTTAGAAGCTACGAAAGTTTTGAGAAGTCATTAGAGGAGCTAAGATCCTTTTTCAGAGATGTCAAGGACCTGGTTAGCAAGAACAAGTCTGATCATTGTGGACAGCAACTACTTCCTTTGGCAAAATATGATGCCACCTTTCCCCAGATGAAGCCTCAAATTGATCTTGCAATTCAAACAAGTGTGCAAGTAGATAAGGACCAGAAGACATGCAAGGAATATATTGATCTAGAATGTAGCTCTCCAAAAGATTCCCGGTGTTTGGAGACTTTATCTATGACATATTCCAATGAAGATCCAGATTTACTCATGGTCAAGGAAACACCTCTTGTGGATTCCAGTATCTCTCTAGCTAGTCCTGGTTCGTTATCTAAGGATGGGAACTCTGGTTCAACTATAATTCAGGCCTCAACTAAGTCTTTGGATCAATTTTTAATTCATCCAATGTCTTCAACAAGCCTGAACGAAGTTCCTTCCTCTTATGAATCTGTGACAATGTTCACCCCTGAAAAAGACGTCAATCAAAACACCTCCAGCCTGATACCAAAAATAGAATCACCTTCGAATCTTAGTGTTTGTTCTTATATTCATAAAAGGAGGGAATCCATGGGTTCACCTTTCATTAACCTTGTTAATGAAGAAGGATCTGATACTCCTGCGCAAATTCCTGGTTCTATGAGTTTCAAGGATAACTTACTAGCGAACAGGCGCAGAATTGAATTTGGTTTTGAGCCTTGTTCAGTAGAAAACAAACAGAAGAAATCAAATGTTCCTCAGTCGTTGGCAACTCACAATTCTTGTCCTGCTATGGATAAGAGATTACAAATTTCTTGTTTACTCTGCAGAAGCCCTTTAGGCCGCCCTGAAAATCATCTGTATCTCAGTTGCTTGTTGACTGTGTCATCAAAAGTTTACCTGTTATCTCTTATAAAAGAGAGATTCATTTGTTGTTCTTCAAATACACTAACAACCGTACCAGTTATAATAACTGACATTTCATCAGTTGATCCAAGACTCTGCAATAGAACTCTTGAAGGTGACACAGGACAGGGCATTTGGCATGAAGATGATGGGTGTGTATTCAAAAATGTCTTTTGCCCCTTCTGCAGTAGCCGTAACAATTGTCTTGGTGTGCAAATCAAAGCGGCTGATGAAAACAATGTTAAGTGGCTAAAGAAGATACTGCTTTTCACTGATTGTGTGGAAATTAGAAATTCTGAAGCAGCAGAGGACAAGGCAGCAAAGGATAAGCACATGGTTATAGGATTTGTCGCAGGTTGA |
Protein: MTSAATGANINPKTVYHIAGIPVEFPYKPYGTQFTFMYRVISTLDRAQKDGHCHTLLESPTGTGKSLSLLCSTLAWQQKYKLKNLQGCLSHSTPAPEAITDPLGHGGGFIPETQPSSVPPSGISEPPQQSANSKNKKKKLAPTIYYASRTHSQISQVIREYRKTSYRVQMAVLASRKNYCTNPHVLGKQNIDEECKLLLSNREEGCFEFKNMHKVKCHPSLQKGGCHEAHDIEDLVKVGQIVKGCAYYAARSMADDAQLVFCPYSYIINPVIRGAMDVDIKGAIIVLDEAHNIEDIARDAGSVDLEEDALLKLQTELEQLYMIDATIYQPLYEMIQDLMGWIEQRKSTLEKHEFRHYFSSWTGDKALRQLQEANISQQCFPILLDCATKAIKAASDTESDAPHLSGMSVITLEGLFSSLTYFFSRNGSHIFDYQLALQRYVKKDGENAFGNWTCSLSLWCLNPAVVFRDIAELSLSVILTSGTLSPMNSFSSELGVQFGNCLEAPHVIDVKSQVWSAIISHGPGNYRLDASYKTADTYAFQDALGKSLEEICKIVPGGSLIFFPSYKLMEKLCKRWRETGQWLRLNARKRLFVEPRGGNQEEFDTVLKGYYDSVSGGKEPGLGRKKRIKKADHNAIESTEVTNHEGAAFLAVFRGKVSEGIDFSDDNARVVIVVGIPFPNINDIQIALKKKYNNTYKSSKNLLSGSEWYCHQAFRALNQALGRCIRHKFDYGAIILLDSRFQEEKNRAYISKWLRPSVRSYESFEKSLEELRSFFRDVKDLVSKNKSDHCGQQLLPLAKYDATFPQMKPQIDLAIQTSVQVDKDQKTCKEYIDLECSSPKDSRCLETLSMTYSNEDPDLLMVKETPLVDSSISLASPGSLSKDGNSGSTIIQASTKSLDQFLIHPMSSTSLNEVPSSYESVTMFTPEKDVNQNTSSLIPKIESPSNLSVCSYIHKRRESMGSPFINLVNEEGSDTPAQIPGSMSFKDNLLANRRRIEFGFEPCSVENKQKKSNVPQSLATHNSCPAMDKRLQISCLLCRSPLGRPENHLYLSCLLTVSSKVYLLSLIKERFICCSSNTLTTVPVIITDISSVDPRLCNRTLEGDTGQGIWHEDDGCVFKNVFCPFCSSRNNCLGVQIKAADENNVKWLKKILLFTDCVEIRNSEAAEDKAAKDKHMVIGFVAG |